# 



SEQUENCE LISTING 



<110> Anderson, Marilyn A. 
Atkinson, Angela H. 
Heath, Robyn L. 
Clarke , Adrienne E . 

<120> PROTEINASE INHIBITOR, PRECURSOR THEREOF AND GENETIC 
SEQUENCES ENCODING SAME 

<130> 9748B 

<140> 09/431,500 
<141> 1999-11-01 

<150> 08/454,295 
<151> 1995-09-01 



:f <160> 16 

0 . ' 

M <170> Patentln Ver. 2.1 

ry 

|S <210> 1 

<211> 1104 

Jlfjj. <212> DNA 

E ^ <213> Nicotiana alata 

El 

M <400> 1 

r . a- 

r LI aaggcttgta ccttaaactg tgatccaaga attgcctatg gagtttgccc gcgttcagaa 



60 



gaaaagaaga atgatcggat atgcaccaac tgttgcgcag gcacgaaggg ttgtaagtac 120 

ttcagtgatg atggaacttt tgtttgtgaa ggagagtctg atcctagaaa tccaaaggct 180 

tgtaccttaa actgtgatcc aagaattgcc tatggagttt gcccgcgttc agaagaaaag 240 

aagaatgatc ggatatgcac caactgttgc gcaggcacga agggttgtaa gtacttcagt 300 

gatgatggaa cttttgtttg tgaaggagag tctgatccta gaaatccaaa ggcttgtcct 360 

cggaattgcg atccaagaat tgcctatggg atttgcccac ttgcagaaga aaagaagaat 420 

gatcggatat gcaccaactg ttgcgcaggc aaaaagggtt gtaagtactt tagtgatgat 480 

ggaacttttg tttgtgaagg agagtctgat cctaaaaatc caaaggcctg tcctcggaat 540 

tgtgatggaa gaattgccta tgggatttgc ccactttcag aagaaaagaa gaatgatcgg 600 

atatgcacca actgctgcgc aggcaaaaag ggttgtaagt actttagtga tgatggaact 660 

tttgtttgtg aaggagagtc tgatcctaaa aatccaaagg cttgtcctcg gaattgtgat 720 

ggaagaattg cctatgggat ttgcccactt tcagaagaaa agaagaatga tcggatatgc 780 

acaaactgtt gcgcaggcaa aaagggctgt aagtacttta gtgatgatgg aacttttgtt 840 



S 4 ? 



I LI 

; XT 

=4 



* 



ggtgaaggag agtctgatcc tagaaatcca aaggcctgtc ctcggaattg tgatggaaga 900 
attgcctatg gaatttgccc actttcagaa gaaaagaaga atgatcggat atgcaccaat 960 
ggttgcgcag gcaagaaggg ctgtaagtac tttagtgatg atggaacttt tatttgtgaa 1020 
ggagaatctg aatatgccag caaagtggat gaatatgttg gtgaagtgga gaatgatctc 
cagaagtcta aggttgctgt ttcc 



<210> 2 
<211> 1360 
<212> DNA 

<213> Nicotiana alata 

<220> 

<221> CDS 

<222> (97) . . (1200) 

<400> 2 



cgagtaagta tggctgttca cagagttagt ttccttgctc tcctcctctt atttggaatg 

J"jj tctctgcttg taagcaatgt ggaacatgca gatgcc aag get tgt acc tta aac 

! -1 Lys Ala Cys Thr Leu Asn 

O ■ 1 5 

clt Isn p™ ?" ??? tat gga gtt tgc cog cgt tea gaa gaa aag 



Cys Asp Pro Arg He Ala Tyr Cly Val Cys PrS Arg Ser gTu IS JJI 

10 15 - 



<t xu 15 20 



P 25 30 



35 



cct aga aat cca aag get tgt acc tta aac tgt gat cca aga att gec 
Pro Arg Asn Pro Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg lie Ala 
b5 60 65 7Q 

tat gga gtt tgc ccg cgt tea gaa gaa aag aag aat gat egg ata tgc 
Tyr Gly Val Cys Pro Arg Ser Glu Glu Lys Lys Asn Asp Arg lit Cyt 
75 80 8 5 

mS C f aC ^ gt tgC gca ggc acg aag 9gt tgt aag tac ttc agt gat gat 
Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr Phe sir lip Hp 
90 95 ioo 

2? a a K fc ^ ?^ tgt gaa gga gag tct g at cct aga aat cca aag get 
Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro tyl III 

110 115 

tgt cct egg aat tgc gat cca aga att gec tat ggg att tac cca ctt 
Cys Pro Arg Asn Cys Asp Pro Arg He Ala Tyr l?y lie Cys Pro Leu" 
±i£U 125 130 



1080 
1104 



60 
114 

162 



ttl ao^ ? gg * ta tgC aCC aac tgt tQ c aca TO<= acg aag ggt tgt 210 

Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lyl Gly Cys 



M f ag fc ^ C agt gat gat gga act tt:t <3tt tgt gaa gga gaa tct aat 

Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu lly llu Ser" AsJ 
4U 45 50 



306 



354 



402 



450 



498 



St IS? 



* 



a?* r?,1 ? 9 f 9 ? gat Cgg ata tgc acc aac tgt tgc gca ggc 

Ala Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly 
1Jb 140 145 15 £ 



f5 a ff,? ?? n Q f 9 ™ aC t ^ t agt gat gat gga act tfct 9tt tgt gaa 
Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 
155 160 i 65 

gga gag tct gat cct aaa aat cca aag gcc tgt cct egg aat tgt gat 
Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 
170 175 180 

gga aga att gcc tat ggg att tgc cca ctt tea gaa gaa aag aag aat 
Gly Arg lie Ala Tyr Gly He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
185 190 195 

gat egg ata tgc acc aac tgc tgc gca ggc aaa aag ggt tgt aag tac 

p o«? 6 Cys Thr Asn Cys Cys Ala G1 y L v s L ys ci y c ys l V s Tyr 

200 205 210 



gat etc cag aag tct aag gtt get gtt tec taagtcctaa ctaataatat 
Asp Leu Gin Lys Ser Lys Val Ala Val Ser 
360 365 



546 



594 



642 



690 



738 



L a o 9 ? gat %? a a ^ fc ttt: gtt tgt gaa gga gag tct gat cct aaa 786 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 

Cft 215 220 225 230 

M. 



834 



882 



»an t f g g ? ^ g cct cgg aat tgt gat gga aga att gcc tat ggg 

,* Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 

W 235 240 245 

HI a n fct ^ gC S Ca ? Lt tca gaa gaa aag aag aat gat cgg ata tgc aca aac 

He Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn 
U 250 2 55 260 

f.l| ^ gC g ? a ggc f aa aag ggc tgt aag tac ttt agt gat gat gga act 930 

Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 
265 270 275 

^ gt gaa gga g^ tct gat cct aga aat cca aag gcc tgt cct 978 
Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Pro 
280 285 290 

cgg aat tgt gat gga aga att gcc tat gga att tgc cca ctt tca gaa 1026 
Arg Asn Cys Asp Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu 
295 300 305 310 

gaa aag aag aat gat cgg ata tgc acc aat tgt tgc gca ggc aag aag 1074 
Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys 
315 320 325 

ggc tgt aag tac ttt agt gat gat gga act ttt att tgt gaa gga gaa 1122 
Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu 
330 335 340 

l Ct of 3 £ at 9 ? C agC aaa gtg gat gaa tat gtt ggt gaa gtg gag aat 1170 
Ser Glu Tyr Ala Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn 
345 350 355 



1220 



gtagtctatg tatgaaacaa aggcatgeca atatgetctg tettgectgt aatctgtaat 1280 



atggtagtgg agcttttcca ctgcctgttt aataagaaat ggagcactag tttgttttag 1340 



ttaaaaaaaa aaaaaaaaaa 

<210> 3 

<211> 368 

<212> PRT 

<213> Nicotiana alata 

<400> 3 

Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg lie Ala Tyr Gly Val Cys 
1 5 10 15 

Pro Arg Ser Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cvs 
20 25 30 

Ala Gly Thr Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val 
35 40 45 

Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Thr Leu Asn 
50 55 60 

Cys Asp Pro Arg He Ala Tyr Gly Val Cys Pro Arg Ser Glu Glu Lys 
65 70 75 80 

Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys 
85 90 95 

Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp 
100 105 HO 

Pro Arg Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Pro Arg lie Ala 
I U 115 120 125 

r% 

;!% T V r G1 y He Cys Pro Leu Ala Glu Glu Lys Lys Asn Asp Arg He Cys 

130 135 140 

Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp 
145 150 155 160 

Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala 
165 170 175 

Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly He Cys Pro Leu 
180 185 190 

Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly 
195 200 205 

Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 
210 215 220 

Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 
225 230 235 240 

Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
245 250 255 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
260 265 270 



1360 



i.rr 



! 5f 



Phe Ser Asp Asp Gly Thr Phe. Val Cys Glu Gly Glu Ser Asp Pro Arg 
Z/D 280 285 



Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
zyu 295 300 

lie Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn 

J1U 315 320 

Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 
32 5 330 335 

Phe He Cys Glu Gly Glu Ser Glu Tyr Ala Ser' Lys Val Asp Glu Tyr 
J4U 345 35 0 

Val Gly Glu Val Glu Asn Asp Leu Gin Lys Ser Lys Val Ala Val Ser 
J " 360 365 



<210> 4 
<211> 24 
<212> PRT 

<213> Nicotiana alata 
<400> 4 

Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg He Ala Tyr Gly Val Cys 
5 10 15 

Pro Arg Ser Glu Glu Lys Lys Asn 
20 



<210> 5 
<211> 58 
<212> PRT 

<213> Nicotiana alata 
<400> 5 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr 

5 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
ZU 25 30 

Asn Pro Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg lie Ala Tyr Gly 
" 40 45 

Val Cys Pro Arg Ser Glu Glu Lys Lys Asn 
50 55 



<210> 6 
<211> 58 
<212> PRT 

<213> Nicotiana alata 



<400> 6 



Asp Arg He Cys Thr Asn Cys Cys 
1 5 

Phe Ser Asp Asp Gly Thr Phe Val 
20 

Asn Pro Lys Ala Cys Pro Arg Asn 

35 40 

He Cys Pro Leu Ala Glu Glu Lys 
50 55 



Ala Gly Thr Lys Gly Cys Lys Tyr 
10 15 



Cys Glu Gly Glu Ser Asp Pro Arg 
25 30 

Cys Asp Pro Arg He Ala Tyr Gly 
45 

Lys Asn 



nr 



<210> 7 
<211> 58 
<212> PRT 

<213> Nicotiana alata 
<400> 7 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 10. ■ 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg lie Ala Tyr Gly 
35 40 45 

lie Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 54 



<210> 8 
<211> 58 
<212> PRT 

<213> Nicotiana alata 



<400> 8 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 10 .is 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg lie Ala Tyr Gly 
35 40 45 

lie Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 



<210> 9 
<211> 58 
<212> PRT 

<213> Nicotiana alata 



<400> 9 



4 



Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
15 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Pro Gly Arg lie Ala Tyr Gly 
35 40 45 

He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 

<210> 10 
<211> 54 
<212> PRT 

<213> Nicotiana alata 
<400> 10 

% 

% Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 

% 1 -5 10 15 

4 Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu Ser Glu Thr Ala 

ii 20 25 30 

! ! Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn Asp Leu Gin Lys 

3 35 40 45 

w Ser Lys Val Ala Val Ser 

iJ 

!j <210> 11 

% <211> 13 

% <212> PRT 

* <213> Nicotiana alata 

A" 

<400> 11 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly 
15 10 

<210> 12 
<211> 13 
<212> PRT 

<213> Nicotiana alata 
<400> 12 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly 
15 10 



<210> 13 
<211> 6 
<212> PRT 



<213> Nicotiana alata 



<400> 13 



Lys Ala Cys Thr Leu Asn 
1 5 



<210> 14 

<211> 5 

<212> PRT 

<213> Nicotiana alata 

<400> 14 

Glu Glu Lys Lys Asn 
1 5 



<210> 15 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

Sequence: Protease - sensitive peptide 

<220> 

<221> UNSURE 
<222> (1) . . (2) 

<223> Xaa can be any amino acid 

<400> 15 

Xaa Xaa Asn Asp 
1 



<210> 16 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 

Sequence : Protease- sensitive peptide 



<220> 

<221> UNSURE 
<222> (1) 

<223> Xaa is lie or Val 



<220> 

<221> UNSURE 



<222> (4) 

<223> Xaa is Arg or Leu 



<220> 

<221> UNSURE 
<222> (5) 

<223> Xaa is Ser or Ala 



<220> 

<221> UNSURE 
<222> (21) 

<223> Xaa is Thr or Lys 
<400> 16 

Xaa Cys Pro Xaa Xaa Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn 
15 io 15 

Cys Cys Ala Gly Xaa Lys Gly 
20 



